HMM-based Whisper Recognition using μ-law Frequency Warping
نویسندگان
چکیده
منابع مشابه
Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis
Speaker adaptation in speech synthesis transforms a source utterance to a target utterance that differs from the source in terms of voice characteristics. In this paper, we employ vocal tract length normalization, which is generally used in speech recognition to remove individual speaker characteristics, to speaker adaptation in speech synthesis. We propose a frequency warping approach based on...
متن کاملOn combining frequency warping and spectral shaping in HMM based speech recognition
Frequency warping approaches to speaker normalization have been proposed and evaluated on various speech recognition tasks [1, 2, 3]. These techniques have been found to signi cantly improve performance even for speaker independent recognition from short utterances over the telephone network. In maximum likelihood (ML) based model adaptation a linear transformation is estimated and applied to t...
متن کاملA New Fast and Efficient HMM-Based Face Recognition System Using a 7-State HMM Along With SVD Coefficients
In this paper, a new Hidden Markov Model (HMM)-based face recognition system is proposed. As a novel point despite of five-state HMM used in pervious researches, we used 7-state HMM to cover more details. Indeed we add two new face regions, eyebrows and chin, to the model. As another novel point, we used a small number of quantized Singular Values Decomposition (SVD) coefficients as feature...
متن کاملHMM-Based Action Recognition Using Contour Histograms
This paper describes an experimental study about a robust contour feature (shape-context) for using in action recognition based on continuous hidden Markov models (HMM). We ran different experimental setting using the KTH’s database of actions. The image contours are extracted using a standard algorithm. The shape-context feature vector is build from of histogram of a set of non-overlapping reg...
متن کاملPitch Mean Based Frequency Warping
In this paper, a novel pitch mean based frequency warping (PMFW) method is proposed to reduce the pitch variability in speech signals at the frontend of speech recognition. The warp factors used in this process are calculated based on the average pitch of a speech segment. Two functions to describe the relations between the frequency warping factor and the pitch mean are defined and compared. W...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: SPIIRAS Proceedings
سال: 2018
ISSN: 2078-9599,2078-9181
DOI: 10.15622/sp.58.2